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Express Mall Label No^ EL615486894US 

1 

IDENTIFICATION OF THE GENE CAUSING THE MOUSE SCURFY 
PHENOTYPE AND ITS HUMAN ORTHOLOG 

CROSS-REFERENCE TO RELATED APPLICATIONS 

5 This application claims priority from U.S. Provisional Application No. 

60/096,195, filed August 1 1, 1998, vMch application is incorporated by reference in its 
entirety. 

TECHNICAL FIELD 

The present invention relates generally to pharmaceutical products and 
10 methods and, more specifically, to methods and compositions useful for diagnosing 
scurfy-related diseases, as well as methods for identifying compounds which can 
modulate the immune system. 

BACKGROUND OF THE INVENTION 

Inherited mutations affecting the murine unmune system have proven to 

15 be a rich source of novel genes critical to the regijation of the immune system and have 
furnished important animal models for human immunological disorders. These include 
xid, the murine equivalent of X-linked agammaglobulinemia (Thomas et al.. Science 
261:355, 1993; Rawlings et al.. Science 257:358, 1993), beige (the equivalent of 
Chediak-Higashi Syndrome) (Barbosa, et al.. Nature 382:262, 1996), Ipr and gld 

20 (defects in fas and fas-Ugand), X-linked severe combined immunodeficiency (Sugamura 
et al., Amu. Rev. Immunol. N:179, 1996), and the hematopoietic cell phosphatase 
mutant motheaten (SHP-1) (Bignon and Siminovitch, Clin Immunol Immmopathol 
75:168, 1994). 

One mouse mutant of particular interest is the as-yet uncloned X-linked 
25 mouse mutant, scurfy (sf). Briefly, mice hemizygous for the scurfy mutation exhibit a 
severe lymphoproliferative disorder. In particular, males hemizygous (X^Vv) for the 
scurfy mutation develop a progressive lymphocytic infiltration of the lymph nodes. 
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spleen, liver and skin resisting in gross morphological symptoms which include 
splenomegaly, hepatomegaly, greatly enlarged lymph nodes, runting, exfoliative 
dermatitis, and thickened malformed ears (Godfrey et ah, Amer. J. Pathol. 138:1379, 
1991; Godfrey et al., Proc. Ndtl Acad. Sci. USA §5:5528, 1991). Other clinical 

5 symptoms include elevated leukocyte counts, hypergaminaglobulinemia, and severe 
anemia (Lyon et al., Proc. Natl Acad. Sci. USA 57:2433, 1990); the death of affected 
males usually occurs by 3 weeks of age. The 5/locus has been mapped to the extreme 
proximal region of the X chromosome, approximately 0.7 centimorgans from the locus 
for sparse-fia- {spj) (Lyon et al., Proc. Natl. Acad. Sci. USA 57:2433, 1990; Blair et al., 

10 Mamm. Genome 5:652, 1994), itself a point mutation within the ornithine 
transcarbamylase gene {Otc) (Veres et al.. Science 237:415, 1987). The j/locus is also 
tightly linked to the murine Gatal, Tcfe3, and Wasp loci (Blair et al., Mamm. Genome 
5:652, 1994; Deny et al., Genomics 29:471, 1995). Similarities between scurfy and 
human Wiskott-Aldrich syndrome (WAS) have been noted (Lyon et al., Proc. Natl. 

15 Acad Sci. USA 57:2433, 1990), and the mouse Wasp gene has been proposed as a 
candidate for scurjy (Lyon et al., Proc. Natl. Acad Sci. USA 57:2433, 1990; Derry et 
al.. Genomics 29:471, 1995). Closer biological examination reveals significant 
differences between WAS and scurfy, however, and the two loci have been 
demonstrated to be non-allelic (Jeffery & Brunkow, unpublished data). Thus, prior to 

20 applicants' invention the identity of the scurfy gene remained to be determined. 

The present invention discloses methods and compositions usefol for 
diagnosing scurfy-related diseases, as well as methods for identifying compounds which 
can modulate the immune system, and further provides other related advantages. 



SUMMARY OF THE INVENTION 

The present invention relates generally to the discovery of novel genes 
which, when mutated, results in a profound lymphoproliferative disorder. In particular, 
a mutant mouse, designated 'Scurfy', was used to identify the gene responsible for this 
disorder through backcross analysis, physical mapping and large-scale DNA 
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sequencing. Analysis of the sequence of this gene indicated that it belongs to a family 
of related genes, all containing a winged-helix DNA binding domain. 

Thus, within one aspect of the invention isolated nucleic acid molecules 
are provided which encode FKH^ or Fkh'*, including mutant forms thereof. Within 
5 certain embodiments, Fkh^ of any type may be from a warm-blooded animal, such as a 
mouse or human. Within further embodiments, isolated nucleic acid molecules are 
provided wherein the nucleic acid molecule is selected from the group consisting of (a) 
a nucleic acid molecule that encodes an amino acid sequence comprismg SEQ ID Nos 
* 2, or, 4, (b) a nucleic acid molecule that hybridizes under stringent conditions to a 

10 nucleic acid molecule having the nucleotide sequence of SEQ ID Nos: 1, or, 3, or its 
complement, and (c) a nucleic acid molecule that encodes a functional fragment of the 
polypeptide encoded by either (a) or (b). Preferably, the nucleic acid molecule is not 
JM2. Within related aspects, vectors (including expression vectors), and recombinant 
host cells are also provided, as well as proteins which are encoded by the above-noted 

15 nucleic acid molecules. Further, fusion proteins are also provided which combine at 
least a portion of the above-described nucleic acid molecules with the coding region of 
another protein. Also provided are oligonucleotide fragments (including probes and 
primers) which are based upon the above sequence. Such firagments are at least 8, 10, 
12, 15, 20, or 25 nucleotides in length, and may extend up to 100, 200, 500, 1000, 1500, 

20 or, 2000 nucleotides in length. 

Within other aspects methods of using the above noted expression vector 
for producing a Fkh^ protein (of any type) are provided, comprising the general steps of 
(a) culturing recombinant host cells that comprise the expression vector and that 
produce Fkh^^protein, and (b) isolating protein from the cultured recombinant host cells, 

25 Also provided are antibodies and antibody fragments that specifically 

bind to Fkh^^ proteins. Representative examples of such antibodies include both 
polyclonal and monoclonal antibodies (whether obtained from a murine hybridoma, or 
derived into human form). Repesentative examples of antibody fragments include 
F(ab')2, F(ab)2, Fab', Fab, Fv, sFv, and minimal recognition units or complementarity 

30 determining regions. 
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Within yet other aspects, methods are provided for detecting the 
presence of a Fk}^ nucleic acid sequence in a biological sample from a subject, 
comprising the steps of (a) contacting a Fldf^ specific nucleic acid probe under 
hybridizing conditions with either (i) test nucleic acid molecules isolated from said 
5 biological sample, or (ii) nucleic acid molecules synthesized from RNA molecules, 
wherein said probe recognizes at least a portion of nucleotide sequence of claim 1, and 
(b) detecting the formation of hybrids of said nucleic acid probe and (i) or (ii). 

Within another related embodiment, methods are provided for detecting 
the presence of an Fkh^, or a mutant form thereof, in a biological sample, comprising 
10 the steps of: (a) contacting a biological sample with an anti-Fkh"*^ antibody or an 
antibody fragment, wherein said contacting is performed under conditions that allow the 
binding of said antibody or antibody fragment to said biological sample, and (b) 
detecting any of said bound antibody or bound antibody firagment. 

Within other aspects of the invention, methods are provided for 
15 introducing FW^ nucleic acid molecules to an animal, comprising the step of 
administering a FJdf nucleic acid molecule as described herein to an animal (e.g., a 
human, monkey, dog, cat, rat, or, mouse. Within one embodiment the nucleic acid 
molecule is contained within and expressed by a viral vector (e.g., a vector generated at 
least in part from a retrovirus, adenovirus, adeno-associated virus, herpes virus, or, 
20 alphavirus). Within another embodiment the nucleic acid molecule is expressed by, or 
contained within a plasmid vector. Such vectors may be administered either in vivo, or 
ex vivo (e.g., to hematopoietic cells such as T cells. 

Within other embodiments, transgenic non-human animals are provided 
wherein the cells of the ammal express a transgene that contains a sequence encoding 
25 Fkh"^ protein. 

These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In addition, 
various references are set forth herein which describe in more detail certain procedures 
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or compositions (e.^., plasmids, etc.), and are therefore incorporated by reference in 
their entirety. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts a nucleotide sequence of mouse Fkh^cUHA (Seqeunce 
5 LD. No, 1); translation is predicted to initiate at position 259 and terminate at position 
1546. 

Figure 2 depicts the amino acid sequence of mouse Fkh^*^ (Sequence LD. 

No. 2). 

Figure 3 depicts a nucleotide sequence of 1735 bp corresponding to 
10 human FKHsf cDNA (Sequence LD, No. 3; including a 1293 bp coding region); 
translation is predicted to initiate at position 55 and terminate at position 1348. 

Figure 4 depicts the sequence of a 431 amino acid human FKH^ protein 
(Sequence LD. No. 4). 

Figure 5 diagranmiatically depicts a vector for generation of ¥Ktl^ 
15 transgenic mice. 

Figure 6 is a photograph which demonstrates that the FKH^ transgene 
corrects the defect in scurfy animals. 

Figure 7 is a graph which shows that FKH^^tg mice have reduced lymph 
node cells, as compared to normal cells. 
20 Figure 8 is a graph which shows that FKH^ transgenic mice respond 

poorly to in vitro stimulation. 

Figure 9 is a comparison of FKIP^ and JM2 cDNAs. 

Figure 10 compares homology in various regions of human FKH^^ and 

murine Fkh^. 
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DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

Prior to setting forth the Invention in detail, it may be helpful to an 
understanding thereof to set forth'definitions of certain terms and to list and to define 

5 the abbreviations that will be used hereinafter. 

' ^Scurfy" refers to an inherited disease in mice which exhibit a severe 
lymphoproliferative disorder {see, e.g., Lyon et al., Froc. Natl Acad ScL USA 57:2433, 
1990). The responsible gene (mutant forms of which are responsible for the disease) is 
shown in Sequence I.D. Nos. 1 and 3. 

10 "Molecule" should be understood to include proteins or peptides {e.g., 

antibodies, recombinant binding partners, peptides with a desired binding affinity), 
nucleic acids {e.g., DNA, RNA, chimeric nucleic acid molecules, and nucleic acid 
analogues such as PNA), and organic or inorganic compounds. 

'' Nucleic acid " or '' nucleic acid molecule " refers to any of 

15 deoxyribonucleic acid (DNA), ribonucleic acid (RNA), oligonucleotides, fragments 
generated by the polymerase chain reaction (PGR), and fragments generated by any of 
ligation, scission, endonuclease action, and exonuclease action. Nucleic acids can be 
composed of monomers that are naturally-occurring nucleotides (such as 
deoxyribonucleotides and ribonucleotides), or analogs of naturally-occurring 

20 nucleotides {e.g., a-enantiomeric forms of naturally-occurring nucleotides), or a 
combination of both. Modified nucleotides can have modifications in sugar moieties 
and/or in pyrimidine or purine base moieties. Sugar modifications include, for example, 
replacement of one or more hydroxyl groups mth halogens, alkyl groups, amines, and 
azido groups, or sugars can be funetionalized as ethers or esters. Moreover, the entire 

25 sugar moiety can be replaced with sterically and electronically similar structures, such 
as aza-sugars and carbocyclic sugar analogs. Examples of modifications in a base 
moiety include alkylated purines and pyrimidines, acylated purines or pyrimidines, or 
other well-known heterocyclic substitutes. Nucleic acid monomers can be linked by 
phosphodiester bonds or analogs of such linkages. Analogs of phosphodiester linkages 

30 include phosphorothioate, phosphorodithioate, phosphoroselenoate. 
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phosphorodiselenoate, phosphoroanilothioate, phosphoranilidate, phosphoramidate, and 
the like. The term "nucleic acid" also includes so-called "peptide nucleic acids," which 
comprise naturally-occurring or modified nucleic acid bases attached to a polyamide 
backbone. Nucleic acids can be either single stranded or double stranded. 
5 "Isolated nucleic acid molecule" is a nucleic acid* molecule that is not 

integrated in the genomic DNA of an oi^anism. For example, a DNA molecule that 
encodes a gene that has been separated from the genomic DNA of a eukaryotic cell is an 
isolated DNA molecule. Another example of an isolated nucleic acid molecule is a 
chemically-synthesizednucleic acid molecule that is not integrated in the genome of an 
10 organism. 

" Promoter " is a nucleotide sequence that directs the transcription of a 
structural gene. Typically, a promoter is located in the 5 ' region of a gene, proximal to the 
transcriptional start site of a structural gene. If a promoter is an inducible promoter, then 
the rate of transcription increases in response to an inducing agent. In contrast, the rate of 
15 transcription is not regulated by an inducing agent if the promoter is a constitutive 
promoter. 

" Vector" refers to an assembly which is capable of directing the 
expression of desired protein. The vector must include transcriptional promoter 
elements which are opembly linked to the genes of interest. The vector may be 

20 composed of either deoxyribonucleic acids ("DNA"), ribonucleic acids ("RNA"), or a 
combination of the two (e.^., a DNA-RNA chimeric). Optionally, the vector may 
include a polyadenylation sequence, one or more restriction sites, as well as one or more 
selectable markers such as neomycin phosphotransferase or hygromycin 
phosphotransferase. Additionally, depending on the host cell chosen and the vector 

25 employed, other genetic elements such as an origin of replication, additional nucleic 
acid restriction sites, enhancers, sequences conferring inducibility of transcription, and 
selectable markers, may also be incorporated into the vectors described herein. 

"Isolated " in the case of proteins or polypeptides, refers to molecules 
which are present in the substantial absence of other biological macromolecules, and 

30 appear nominally as a single band on SDS-PAGE gel with coomassie blue staining. 



8 



" Isolated " when referring to organic molecules means that the compounds are greater 
than 90% pure utilizing methods which are well known in the art (e.g., NMR, melting 
point). 

' 'Cloning vector' ' refers to nucleic acid molecules, such as a plasmid, 
5 cosmid, or bacteriophage, that has the capability of replicating autonomously in a host 
cell Cloning vectors typically contain one or a small number of restriction endonudease 
recognition sites at which foreign nucleotide sequences can be inserted in a determinable 
fashion without loss of an essential biological function of the vector, as well as nucleotide 
sequences encoding a marker gene that is suitable for use in the identification and 
10 selection of cells transformed with the cloning vector. Marker genes typically include 
genes that provide tetracycline resistance or ampicillin resistance. 

''Ex pression vector" " refers to a nucleic acid molecule encoding a gene that 
is expressed m a host cell. Typically, gene expression is placed under the control of a 
promoter, and optionally, under the control of at least one regulatory element. Such a 
1 5 gene is said to be "operably linked to" the promoter. Similarly, a regulatory element and a 
promoter are operably linked if the regulatory element modulates the activity of the 
promoter. 

" Recombinant host" refers to any prokaryotic or eukaryotic cell that 
contains either a cloning vector or expression vector. This term also includes those 
20 prokaryotic or eukaryotic cells that have been genetically engineered to contain the cloned 
gene(s) in the chromosome or genome of the host cell. 

In eukaryotes, RNA polymerase II catalyzes the transcription of a 
structural gene to produce mRNA. A nucleic acid molecule can be designed to contain an 
RNA polymerase II template in which the RNA transcript has a sequence that is 
25 complementary to that of a specific mRNA. The RNA transcript is termed an "anti-sense 
RNA" and a nucleic acid molecule that encodes the anti-sense RNA is termed an "anti- 
sense gene." Anti-sense RNA molecules are capable of binding to mRNA molecules, 
resulting in an inhibition of mRNA translation. 

An "anti-sense oligonucleoti de s pecific for FW-^ " or a 'TW^ anti-sense 
30 oligonucleotide" is an oligonucleotide having a sequence (a) capable of forming a stable 
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triplex with a portion of the gene, or (b) capable of forming a stable duplex with a 
portion of an mRNA transcript Similarly, an "anti-sense oligonucleotide specific for 
^'Fkh^' or a ^^Fkff^ anti-sense oligonucleotide'' is an oligonucleotide having a sequence 
(a) capable of forming a stable triplex with a portion of the F^fe/r^gene, or (b) capable of 
5 forming a stable duplex with a portion of an mRNA transcript of the FM*^gene, 

A ' ^ribozvme" is a nucleic acid molecule that contains a catalytic center. 
The term includes RNA enzymes, self-splicing RNAs, self-cleaving RNAs, and nucleic 
acid molecules that perform these catalytic functions. A nucleic acid molecule that 
encodes a ribozyme is termed a '''ribozyme gaie/' 

10 Abbreviations : YAC, yeast artificial chromosome; PCR, polymerase 

chain reaction; RT-PCR, PGR process in which RNA is first transcribed into DNA at 
the first step using reverse transcriptase (RT); cDNA, any DNA made by copying an 
RNA sequence into DNA form. As utilized herein "Fkh^^" refers to the gene product of 
the Fkh^ gene (irrespective of whether the gene is obtained fi*om humans, mammals, or 

15 any other warm-blooded animal). When capitalized "FKIT^" the gene product (and 
gene) should be xmderstood to be derived from humans. 

As noted above, the present invention relates generally to pharmaceutical 
products and methods and, more specifically, to methods and compositions usefiil for 
20 diagnosing scurfy-related diseases, as well as methods for identifying compounds which 
can modulate the inunune system. 

Thus, as discussed in more detail below this discovery has led to the 
development of assays which may be utilized to select molecules which can act as 
agonists, or alternatively, antagonists of the immune system. Furthermore, such assays 
25 may be utilized to identify other genes and gene products which are likewise active in 
modulating the inmiune system. 

Scurfy 

Briefly, the present inventions are based upon the unexpected discovery 
30 that a mutation in the gene which encodes Fkh'^ results in rare condition (scurfy) 
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characterized by a progressive lymphocytic infiltration of the lymph nodes, spleen, liver 
and skin resulting in gross morphological symptoms which include splenomegaly, 
hepatomegaly, greatiy enlarged lymph nodes, runting, exfoliative dermatitis, and 
thickened malformed ears (Godfi-ey et al., Amer, J. Pathol 138:137% 1991; Godfrey et 

5 ' al.. Proa Natl Acad ScL USA 55:5528, 1991), This new member of tiie winged-helix 
family represents a novel component of the immune system. 

Methods which were utilized to discover the gene responsible for scurfy 
are provided below in Example 1 . Methods for cloning the gene responsible for murine 
scurfy, as well as the human ortholog, are provided below in Examples 2 and 3. 

10 Methods for confirmation of gene identity and correlation witii gene function, as 
determined using transgenic mice, are also provided in the Examples. 

Also provided by the present invention are methods for determining the 
presence of Fkfr^ genes and gene products. Within one embodiment, such methods 
comprise tiie general steps of (a) contacting a Fkff^ specific nucleic acid probe under 

15 hybridizing conditions with either fi) test nucleic acid molecules isolated from the 
biological sample, or (ii) nucleic acid molecules synthesized from RNA molecules, 
wherein the probe recognizes at least a portion of an Fkh'^ nucleotide sequence, and (b) 
detecting the formation of hybrids of said nucleic acid probe and (i) or (ii). A variety of 
methods may be utilized in order to amplify a selected sequence, including, for 

20 example, RNA amplification (see Lizardi etal., Bio/Technology 5:1197-1202, 1988; 
Kramer et al. Nature iJP:401-402, 1989; Lomeh etal.. Clinical Chem, i5(9):1826- 
1831, 1989; U.S. Patent No. 4,786,600), and nucleic acid amplification utilizing 
Polymerase Chain Reaction ("PGR") (see U.S. Patent Nos. 4,683,195, 4,683,202, and 
4,800,159), reverse-transcriptase-PCR and CPT (see U.S. Patent Nos. 4,876,187, and 

25 5,011,769). 

Alternatively, antibodies may be utilized to detect the presence of Fkh'^ 
gene products. More specifically, within one embodiment methods are provided for 
detecting the presence of an Fkh"^ peptide, or a mutant form tiierepf, in a biological 
sample, comprising the steps of (a) contacting a biological sample with an anti- Fkh'*" 
30 antibody or an antibody fiagment, wherein said contacting is performed under 
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conditions that allow the binding of said antibody or antibody fragment to the biological 
sample, and (b) detecting any of the bound antibody or bound antibody fragment. 

Such methods may be accomplished in a wide variety of assay formats 
including, for example, Countercurrent Immuno-Electrophoresis (CIEP), 
5 Radioimmunoassays, Radioinummoprecipitations, Enzyme-Linked Immuno-Sorbent 
Assays (ELISA), Dot Blot assays. Inhibition or Competition assays, and sandwich 
assays {see U.S. Patent Nos. 4,376,110 and 4,486,530; see also Antibodies: A 
Laboratory Manual supra), 

10 Nucleic Acid Molecules, Proteins, and Methods of Producing Proteins 

Although various FKH^*^ or Fkh^^ proteins and nucleic acid molecules (or 
portions thereof) have been provided herein, it should be understood that within the 
context of the present invention, reference to one or more of these proteins should be 
understood to include proteins of a substantially similar activity. As used herein, 

15 proteins are deemed to be "substantially similar'' if: (a) they are encoded by a 
nucleotide sequence which is derived from the coding region of a gene which encodes 
the protein (including, for example, portions of the sequence or allelic variations of the 
sequence); (b)the nucleotide sequence is capable of hybridization to nucleotide 
sequences of the present invention imder moderate, high or very high stringency {see 

20 Sambrook et ah. Molecular Cloning: A Laboratory Manual^ 2nd ed.. Cold Spring 
Harbor Laboratory Press, NY, 1989), or has at least 50%, 60%, 70%, 75%, 80%, 90%, 
95%, or greater homology to the sequences disclosed herein, or, (c) the DNA sequences 
are degenerate as a result of the genetic code to the DNA sequences defined in (a) or 
(b). Further, the nucleic acid molecule disclosed herein includes both complementary 

25 and non-K:ompIementary sequences, provided the sequences otherwise meet the criteria 
set forth herein. Within the context of the present invention, high stringency means 
standard hybridization conditions (e.g., 5XSSPE, 0.5% SDS at 65X, or the equivalent). 
For purpose of hybridization, nucleic acid molecules which encode the amino-terminal 
domain, zinc finger domain, middle domain, or forkhead domain (see Example 10) may 

30 be utilized. 
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The structure of the proteins encoded by the nucleic acid molecules 
described herein may be predicted from the primary translation products using the 
hydrophobicity plot fiinction of; for example, P/C Gene or Intelligenetics Suite 
(Intelligenetics, Mountain View/Califomia), or according to the methods described by 

5 Kyte and Doolitde (J. Mol Biol. 157: 1 05-132, 1 982). 

Proteins of the present invention may be prepared in die form of acidic 
or basic salts, or in neutral form. In addition, individual amino acid residues may be 
modified by oxidation or reduction. Furthermore, various substitutions, deletions, or 
additions may be made to the amino acid or nucleic acid sequences, the net effect of 

1 0 which is to retain or further enhance or decrease the biological activity of the mutant or 
wild-type protein. Moreover, due to degeneracy in the genetic code, for example, there 
may be considerable variation in nucleotide sequences encodmg the same amino acid 
sequence. 

Other derivatives of the proteins disclosed herein include conjugates of 

15 the proteins along with odier proteins or polypeptides. This may be accomplished, for 
example, by the synthesis of N-terminal or C-terminal fusion proteins which may be 
added to facilitate purification or identification of proteins {see U.S. Patent No. 
4,851,341, see also, Hopp etal., Bio/Technology 6:1204, 1988.) Alternatively, fusion 
proteins {e.g., FKH or Fkh-Iuciferase or FKH or Fkh-GFP) may be constructed in order 

20 to assist in the identification, expression, and analysis of the protein. 

Proteins of the present invention may be constracted using a wide variety 
of techniques described herein. Further, mutations may be introduced at particular loci 
by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction 
sites enabling ligation to fragments of die native sequence. Following ligation, die 

25 resulting reconstructed sequence encodes a derivative having the desired amino acid 
insertion, substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific (or s^ment specific) 
mutagenesis procedures may be employed to provide an altered gene having particular 
codons altered according to the substimtion. deletion, or insertion required. Exemplary 

30 methods of making the alterations set forth above are disclosed by Walder et al. {Gene 
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42:133, 1986); Bauer etal. {Gem 57:73, 1985); Craik {BioTechniques, January 1985, 
12-19); Smith etal. {Genetic Engineering: Principles and Methods, Plenum Press, 
1981); and Sambrook et al. {supra). Deletion or truncation derivatives of proteins {e.g, 
a soluble extracellular portion) may also be constructed by utilizing convenient 
5 restriction endonuclease sites adjacent to the desired deletion. Subsequent to restriction, 
overhangs may be filled in, and the DNA religated. Exemplary methods of making the 
alterations set forth above are disclosed by Sambrook et al. {Molecular Cloning: A 
Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, 1 989). 

.Mutations which are made in the nucleic acid molecules of the present 

10 invention preferably preserve the reading frame of the coding sequences. Furthermore, 
the mutations will preferably not create complementary regions that could hybridize to 
produce secondary mRNA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA. Although a mutation site may be predetermined, it is 
not necessary that the nature of the mutation per se be predetermined. For ex^ple, in 

15 order to select for optimum characteristics of mutants at a given site, random 
mutagenesis may be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. Alternatively, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 
by restriction sites enabling ligation to fragments of the native sequence. Following 

20 ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. Mutations may be introduced for 
purpose of preserving or increasing activity of the protein, or, for decreasing or 
disabling the protein {e,g, mutant Fkh), 

Nucleic acid molecules which encode proteins of the present invention 

25 may also be constructed utilizing techniques of PCR mutagenesis, chemical 
mutagenesis (Drinkwater and Klinedinst, PNAS 55:3402-3406, 1986), by forced 
nucleotide misincorporation (e.g., Liao and Wise Gene 55:107-1 11, 1990), or by use of 
randomly mutagenized oligonucleotides (Horwitz et al.. Genome 5:112-1 17, 1989). 

The present invention also provides for the manipulation and expression 

30 of the above described genes by culturing host cells containing a vector capable of 
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expressing the above-described genes. Such vectors or vector constructs include either 
synthetic or cDNA-derived nucleic acid molecules encoding the desired protein, which 
are operably linked to suitable transcriptional or translational regulatory elements. 
Suitable regulatory elements may be derived from a variety of sources, including 

5 bacterial, fungal, viral, manunalian, insect, or plant genes. Selection of appropriate 
regulatory elements is dependent on the host cell chosen, and may be readily 
accomplished by one of ordinary skill in the art. Examples of regulatory elements 
include: a transcriptional promoter and enhancer or RNA polymerase binding 
sequence, a transcriptional terminator, and a ribosomal binding sequence, including a 

1 0 translation initiation signal. 

Nucleic acid molecules that encode any of the proteins described above 
may be readily expressed by a wide variety of prokaryotic and eukaryotic host cells, 
including bacterial, mammalian, yeast or other fungi, viral, insect, or plant cells. 
Methods for transforming or transfecting such cells to express foreign DNA are well 

15 known in the art (see, e.g., Itakura et al., U.S. Patent No. 4,704,362; Hinnen et al., Proc, 
Natl Acad. ScL USA 75:1929-1933, 1978; Murray etal,, U.S. Patent No. 4,801,542; 
Upshall et al., U.S. Patent No. 4,935,349; Hagen et aL, U.S. Patent No. 4,784,950: Axel 
etal., U.S. Patent No. 4,399,216; Goeddel etal., U.S. Patent No. 4,766,075; and 
Sambrook etal. Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring 

20 Harbor Laboratory Press, 1989; for plant cells see Czako and Marton, Plant Physiol. 
704:1067-1071, 1994; and Paszkowski et al., Biotech. 24:387-392, 1992). 

Bacterial host cells suitable for carrying out the present invention include 
E. coll, B. subtilis. Salmonella typhimurium, and various species within the genera 
Pseudomonas, Streptomyces, and Staphylococcus, as well as many other bacterial 

25 species well known to one of ordinary skill in the art. Representative examples of 
bacterial host cells include DH5a (Stratagene, LaJolla, California). 

Bacterial expression vectors preferably comprise a promoter which 
functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin of replication. Representative promoters include the p-lactamase (penicillinase) 

30 and lactose promoter system {see Chang et al.. Nature 275:615, 1978), the T7 RNA 
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polymerase promoter (Studier et al., Meth. Enzymol 755:60-89, 1990), the lambda 
promoter (Elvin et al.. Gene 57:123-126, 1990), the trp promoter (Nichols and 
Yanofsky, Meth. in Enzymology 101:155, 1983) and the tac promoter (Russell etal., 
Gene 20:231, 1982). Representative selectable markers include various antibiotic 

5 resistance markers such as the kanamycin or ampicillin resistance genes. Many 
plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 (see Bolivar etal.. Gene 2:95, 1977), the pUC plasmids pUC18, 
pUC19, pUCllS, pUC119 (see Messmg, Meth. in Enzymology 101:20-11, 1983 and 
Vieira and Messing, Gene ;P:259-268, 1982), and pNH8A, pNH16a, pNHlSa, and 

1 0 Bluescript M 1 3 (Stratagene, La Jolla, California). 

Yeast and fungi host cells suitable for carrying out the present invention 
include, among others, Saccharomyces pombe, Saccharomyces cerevisiae, the genera 
Pichia or Kluyveromyces and various species of the genus Aspergillus (McKnight et al., 
U.S. Patent No. 4,935,349). Suitable expression vectors for yeast and ftmgi include, 

15 among others, YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 
(Tumbull, Bio/Technology 7:169, 1989), YRp7 (Struhl etal., Proc. Natl. Acad. Set 
USA 7<5:1035-1039, 1978), YEpl3 (Broach et al.. Gene 5:121-133, 1979), pJDB249 and 
pJDB219 (Beggs, Mature 275:104-108, 1978) and derivatives thereof. 

Preferred promoters for use in yeast include promoters bom yeast 

20 glycolytic genes (Hitzeman etal., J. Biol. Chem. 255:12073-12080, 1980; Alber and 
Kawasaki, J. Mol. Appl. Genet. 7:419434, 1982) or alcohol dehydrogenase. genes 
(Young et al., in Genetic Engineering of Microorganisms for Chemicals., HoUaender 
et al. (eds.), p. 355, Plenum, New York, 1982; Ammerer, Meth. Enzymol. 707:192-201, 
1983). Examples of useful promoters for ftmgi vectors include those derived from 

25 Aspergillus nidulans glycolytic genes, such as the adh3 promoter (McKnight et al., 
EMBOJ. 4:2093-2099, 1985). The expression units may also include a transcriptional 
temiinator. An example of a suitable terminator is the adh3 terminator (McKnight 
et al., ibid., 1985). 

As with bacterial vectors, the yeast vectors will generally include a 

30 selectable marker, which may be one of any number of genes that exhibit a dominant 
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phenotype for which a phenotypic assay exists to enable transformants to be selected. 
Preferred selectable maricers are those that complement host cell auxotrophy, provide 
antibiotic resistance or enable a cell to utilize specific carbon sources, and include len2 
(Broach etal., ibid), ura3 (Botstein etal.. Gene 8:17, 1979), or his3 (Struhl etaL, 
5 ibid). Another suitable selectable marker is the cat gene, which confers 
chloramphenicol resistance on yeast cells. 

Techniques for transforming fungi are well known in the literature, and 
have been described, for instance, by Beggs (ibid,), Hinnen et aL (Proc, Natl, Acad, Sci. 
USA 75:1929-1933, 1978), Yelton etal. (Proc. Natl Acad Sci. USA <97: 1740- 1747, 
10 1984), and Russell (Nature 507:167-169, 1983). The genotype of the host cell may 
contain a genetic defect that is complemented by the selectable marker present on the 
expression vector. Choice of a particular host and selectable marker is well within the 
level of ordinary skill in the art. 

Protocols for the transformation of yeast are also well known to those of 
15 ordinary skill in the art. For example, transformation may be readily accomplished 
either by preparation of spheroplasts of yeast with DNA (see Hinnen et al., PNAS USA 
75:1929, 1978) or by treatment with alkaline salts such as LiCl (see Itoh etal., J. 
Bacteriology 153:163. 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. (Bio/Technology 5:369, 1987). 
20 Viral vectors include those which comprise a promoter that directs the 

expression of an isolated nucleic acid molecule that encodes a desired protein as 
described above. A wide variety of promoters may be utilized within the context of the 
present invention, including for example, promoters such as MoMLV LTR, RSV LTR, 
Friend MuLV LTR, adenoviral promoter (Ohno etal.. Science 2(55:781-784, 1994), 
25 neomycin phosphotransferase promoter/enhancer, late parvovirus promoter (Koering 
etal.. Hum, Gene Therap, 5:457-463, 1994), Herpes TK promoter, SV40 promoter, 
metallothionein Ila gene enhancer/promoter, cytomegalovirus immediate early 
promoter, and the cytomegalovirus immediate late promoter. Within particularly 
preferred embodiments of the invention, the promoter is a tissue-specific promoter (see 
30 e,g., WO 91/02805; EP 0,415,731; and WO 90/07936). Representative examples of 
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suitable tissue specific promoters include neural specific enolase promoter, platelet 
derived growth factor beta promoter, human aljAal-chimaerin promoter, synapsin I 
promoter and synapsin II promoter. In addition to the above-noted promoters, other 
viral-specific promoters (e.g., retroviral promoters (including those noted above, as well 
5 as others such as HIV promoters), hepatitis, herpes (e.g., EBV), and bacterial, fimgai- or 
parasitic (e.g, malarial) -specific promoters may be xitilized in order to target a specific 
cell or tissue which is infected with a virus, bacteria, fimgus or parasite, 

Manmialian cells suitable for carrying out the present invention include, 
among others: PC12 (ATCC No. CRL1721), NlE-115 neuroblastoma, SK-N"BE(2)C 

10 neuroblastoma, SHSY5 adrenergic neuroblastoma, NS20Y and NGl 08-15 murine 
cholinergic cell lines, or rat F2 dorsal root ganglion line, COS (e.g., ATCC No. CRL 
1650 or 1651), BHK (e.g, ATCC No. CRL 6281; BHK 570 cell line (deposited with 
the American Type Culture Collection under accession number CRL 10314)), CHO 
(ATCC No. CCL 61), HeLa (e.g, ATCC No. CCL 2), 293 (ATCC No. 1573; Graham 

15 et al., J. Gen Virol 55:59-72, 1977) and NS-1 cells. Other manmialian cell lines may 
be used within the present invention, including Rat Hep I (ATCC No. CRL 1600), Rat 
Hep II (ATCC No. CRL 1548), TCMK (ATCC No. CCL 139), Human lung (ATCC 
No. CCL 75.1), Human hepatoma (ATCC No. HTB-52), Hep G2 (ATCC No. HB 
8065), Mouse liver (ATCC No. CCL 29.1), NCTC 1469 (ATCC No. CCL 9.1), SP2/0- 

20 Agl4 (ATCC No. 1581), HIT-T15 (ATCC No. CRL 1777), Jurkat (ATCC No. Tib 152) 
and RINm 5AHT2B (Orskov and Nielson, F£55 229(1): 175- 178, 1988). 

Mammalian expression vectors for use in carrying out the present 
invention will include a promoter capable of directing the transcription of a cloned gene 
or cDNA. Preferred promoters include viral promoters and cellular promoters. Viral 

25 promoters include the cytomegalovirus immediate early promoter (Boshart et al.. Cell 
V/:52 1-530, 1985), cytomegalovirus immediate late promoter, SV40 promoter 
(Subramani etal., Mol Cell Biol 7:854-864, 1981), MMTV LTR, RSV LTR, 
metallothionein-1, adenovirus El a. Cellular promoters include the mouse 
metaIlothionein-1 promoter (Palmiter etal., U.S. Patent No. 4,579,821), a mouse 

30 promoter (Bergman et al., Proc. Natl Acad ScL USA 57:7041-7045, 1983; Grant et al.. 
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Nucl Acids Res, 75:5496, 1987) and a mouse Vh promoter (Loh et ai„ Cell JJ:85-93, 
1983). The choice of promoter will depend, at least in part, upon the level of expression 
desired or the recipient cell line to be transfected. 

Such expression vectors may also contain a set of RNA splice sites 
5 located downstream from the promoter and upstream from the DNA sequence encoding 
the peptide or protein of interest Preferred RNA splice sites may be .obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest Suitable 
polyadenylation signals include the early or late polyadenylation signals from SV40 

10 (Kaufman and Sharp, ibid), the polyadenylation signal from the Adenovirus 5 ElB 
region and the human growth hormone gene terminator (DeNoto et al., Nuc, Acids Res, 
P:3719-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 
the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 

15 the SV40 enhancer. Expression vectors may also include sequences encoding the 
adenovirus VA RNAs. Suitable expression vectois can be obtained from commercial 
sources {e.g,^ Stratagene, La JoUa, California). 

Vector constructs comprising cloned DNA sequences can be introduced 
into cultured mammalian cells by, for example, calcium phosphate-mediated 

20 transfection (Wigler etal.. Cell 14:125, 1978; Corsaro and Pearson, Somatic Cell 
Genetics 7:603, 1981; Graham and Van der Eb, Virology 52:456, 1973), electroporation 
(Neumann etal., EMBO 1 7:841-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel et al. (eds.). Current Protocols in Molecular Biology, John Wiley and Sons, 
Inc., NY, 1987). To identify cells that have stably integrated the cloned DNA, a 

25 selectable marker is generally introduced into the cells along with the gene or cDNA of 
interest. Preferred selectable markers for use in cultured mammalian cells include genes 
that confer resistance to drugs, such as neomycin, hygromycin, and methotrexate. Other 
selectable markers include fluorescent proteins such as GFP (green fluorescent protein) 
or BFP (blue fluorescent protein). The selectable mariner may be an amplifiable 

30 selectable marker. Preferred amplifiable selectable markers are the DHFR gene and the 
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neomycin resistance gene. Selectable markers are reviewed by Thilly (Mammalian Cell 
Technology, Butterworth Publishers, Stoneham, MA, which is incorporated herein by 
reference). 

Mammalian cells containing a suitable vector are allowed to grow for a 
5 period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest. 
Drug selection is then applied to select for growth of cells that are expressing the 
selectable marker in a stable fashion. For cells that have been transfected with an 
amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 

10 expression levels. Cells expressing the introduced sequences are selected and screened 
- for production of the protein of interest in the desired fonn or at the desired level. Cells 
that satisfy these criteria can then be cloned and scaled up for production. Cells may 
also be selected for transfection based on their expression of GFP by sorting for GPP- 
positive cells using a flow cytometer. 

15 Protocols for the transfection of mammalian cells are well known to 

those of ordinary skill in the art. Representative methods include calcium phosphate 
mediated transfection, electroporation, lipofection, retroviral, adenoviral and protoplast 
fusion-mediated transfection {see Sambrook et al., supra). Naked vector constructs can 
also be taken up by muscular cells or other suitable cells subsequent to injection into the 

20 muscle of a mammal (or other animals). 

Numerous insect host cells known in the art can also be useful within the 
present invention, in light of the subject specification. For example, the use of 
baculoviruses as vectors for expressing heterologous DNA sequences in insect cells has 
been reviewed by Atkinson et al. {Pestic. ScL 25:215-224,1990). 

25 Numerous plant host cells known in the art can also be useful within the 

present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al. (J. Biosci, (Bangalore) II:47-5S, 1987). 

Within related aspects of the present invention, proteins of the present 

30 invention, may be expressed in a transgenic animal whose gerai cells and somatic cells 
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contain a gene which encodes the desired protein and which is operably linked to a 
promoter effective for the expression of the gene. Alternatively, m a similar manner 
transgenic animals may be prepared that lack the desired gene (e.g., "knockout" mice). 
Such transgenics may be prepared in a variety non-human animals, including mice, rats, 

5 rabbits, sheep, dogs, goats and pigs {see Hammer etal., Nature 575:680-683, 1985, 
Palmiter etal.. Science 222:809-814, 1983, Brinster etal., Proc, Natl Acad. ScL USA 
52:4438-4442, 1985, Pahniter and Brinster, Cell ^7:343-345, 1985, and U.S. Patent 
Nos. 5,175,383, 5,087,571, 4,736,866, 5,387,742, 5,347,075, 5,221,778, and 
5,175,384). Briefly, an expression vector, including a nucleic acid molecule to be 

10 expressed together with appropriately positioned expression control sequences, is 
introduced into pronuclei of fertilized eggs, for example, by microinjection. Integration 
of the injected DNA is detected by blot analysis of DNA from tissue samples. It is 
preferred that the introduced DNA be incorporated into the germ line of the animal so 
that it is passed on to the animal's progeny. Tissue-specific expression may be 

15 achieved through the use of a tissue-specific promoter, or through the use of an 
inducible promoter, such as the metallothionein gene promoter (Palmiter etal, 1983, 
/&?£/), which allows regulated expression of the transgene. 

Animals which produce mutant forms of Fkh'*^ other than the naturally 
occurring scurfy mutant Csf), or in genetic backgrounds different from the naturally 

20 occurring mutant, may be readily produced given the disclosure provided herein. 

Proteins can be isolated by, among other methods, culturing suitable host 
and vector systems to produce the recombinant translation products of the present 
mvention. Supematants from such cell lines, or protein inclusions or whole cells where 
the protein is not excreted into the supernatant, can then be treated by a variety of 

25 purification procedures in order to isolate the desired proteins. For example, the 
supernatant may be first concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 
Following concentration, the concentrate may be applied to a suitable purification 
matrix such as, for example, an anti-protein antibody bound to a suitable support. 

30 Alternatively, anion or cation exchange resins may be employed in order to purify the 
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protein. As a further alternative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 
Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

5 A protein is deemed to be "isolated" vwthin the context of the present 

invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by Coomassie blue staining. Within other embodiments, the desired protein 
can be isolated such that no other (undesired) protein is detected pursuant to SDS- 
PAGE analysis followed by silver staining. 

10 

Assays for Selecting Molecules Which Modulate the immune system 

As noted above, the present invention provides methods for selecting 
and/or isolating molecules which are capable of modulating the immune system. 
Representative examples of suitable assays include the yeast and mammalian 2-hybrid 

15 systems (e.g., Dang et al, MoL Cell Biol I /:954, 1991; Fearon et al., Proc, Natl Acad. 
Set USA 5P:7958, 1992), DNA binding assays, antisense assays, traditional protein 
binding assays {e.g., utilizing ^-'I or time-resolved fluorescence), immunopreceipitation 
coupled with gel electrophoresis and direct protein sequencing, transcriptional analysis 
of Fkh^ regulated genes, cytokine production and proliferation assays. 

20 For example, within one embodiment proteins that directly interact with 

Fkh'^ can be detected by an assay such as a yeast 2-hybrid binding system (see, e.g., 
U.S. Patent Nos. 5,283,173, 5,468,614, 5,610,015, and 5,667,973). Briefly, in a two- 
hybrid system, a fusion of a DNA-binding domain- Fkh"^ protein (eg., GAL4- Fkh"^ 
fusion) is constructed and transfected into a cell containing a GAL4 binding site linked 

25 to a selectable marker gene. The whole Fkh'^ protein or subregions of Fkh'*' may be 
used. A library of cDNAs fused to the GAL4 activation domain is also constructed and 
co-transfected. When the cDNA in the cDNA-GAL4 activation domain fusion encodes 
a protein that interacts with Fkh'^ the selectable marker is expressed. Cells containing 
the cDNA are then grown, the construct isolated and characterized. Other assays may 

30 also be used to identify interacting proteins. Such assays include ELISA, Western 
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blotting, co-immimoprecipitations, in vitro transcription/translation analysis and the 
like. 

Within another aspect of the present invention, methods are provided for 
determining v^hether a selected molecule is capable of modulating the inunune system, 
5 comprising the steps of (a) exposing a selected candidate molecule to cells which 
express Fkh^^^ or, mutant Fkff^, and (b) determining whether the molecule modulates the 
, activity of Fkh^ , and thereby determining whether said molecule can modulate the 
immune system. Cells for such tests may derive from (a) normal lymphocytes, (b) cell 
lines engineered to overexpress the FKH^ (or Fkh^ protein (or mutant forms thereof) or 
10 (c) transgenic animals engineered to express said protem. Cells from such transgenic 
mice are characterized, in part, by a hyporesponsive state including diminished cell 
number and a decreased responsiveness to various stimuli {e,g„ Example 8)* 

It should be noted that while the methods recited herein may refer to the 
analysis of an individual test molecule, that the present invention should not be so 
15 limited. In particular, the selected molecule may be contained wdthin a mixture of 
compounds. Hence, the recited methods may further comprise the step of isolating the 
desired molecule. Furthermore, it should be understood that candidate molecules can be 
assessed for their ability to modulate the immune system by a number of parameters, 
including for example, T-cell proliferation, cytokine production, and the like. 

20 

Candidate Molecules 
A wide variety of molecules may be assayed for their ability to modulate 
the immune system. Representative examples which are discussed in more detail below 
include organic molecules, proteins or peptides, and nucleic acid molecules. 

25 

1 . Organic Molecules 

Nimierous organic molecules may be assayed for their ability to 
modulate the immune system. For example, vdthin one embodiment of the invention 
suitable organic molecules may be selected either from a chemical library, wherein 
30 chemicals are assayed individually, or from combinatorial chemical libraries where 
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multiple compounds are assayed at once, then deconvoluted to determine and isolate the 
most active compounds. 

Representative examples of such combinatorial chemical libraries 
include those described by Agrafiotis etal., "System and method of automatically 
5 generating chemical compoimds with desired properties," U.S. Patent No. 5,463,564; 
Armstrong, R.W., "Synthesis of combinatorial arrays of organic compounds through the 
use of multiple component combinatorial array syntheses/' WO 95/02566; Baldwin, J. J. 
etal., "Sulfonamide derivatives and their use," WO 95/24186; Baldwin, J J. etal., 
"Combinatorial dihydrobenzopyran library," WO 95/30642; Brenner, S., "New kit for 

10 preparing combinatorial libraries" WO 95/16918; Chenera, B. etal., "Preparation of 
library of resin-bound aromatic carbocyclic compounds," WO 95/16712; EUman, J.A., 
"Solid phase and combinatorial synthesis of benzodiazepine compounds on a solid 
support," U.S. Patent No. 5,288,514; Felder, E. et al., "Novel combinatorial compound 
libraries," WO 95/16209: Lemer. R. et al., "Encoded combinatorial chemical libraries," 

15 WO 93/20242; Pavia, M.R. etal., "A method for preparing and selecting 
pharmaceutically useful non-peptide compounds from a structurally diverse universal 
library," WO 95/04277; Summerton, J.E. and D.D. Weller, "Morpholino-subunit 
combinatorial library and method," U.S. Patent No. 5,506,337; Holmes, C, "Methods 
for the Solid Phase Synthesis of Thiazolidinones, Metathiazanones, and Derivatives 

20 thereof," WO 96/00148; Phillips, G.B. and G.P. Wei, "Solid-phase Synthesis of 
Benzimidazoles," Tet Letters 37:4887-90, 1996; Ruhland, B. etal., "Solid-supported 
Combinatorial Synthesis of Structurally Diverse p-Lactams," 1 Amer. Chem, Soc, 
7/7:253-4, 1996; Look, G.C. etaL, "The Indentification of Cyclooxygenase-1 
Inhibitors from 4-Thiazolidinone Combinatorial Libraries," Bioorg and Med Chem, 

25 Letters 6:707-12, 1996. 



2. Proteins and Peptides 

A wide range of proteins and peptides make likewise be utilized as 
candidate molecules for modulating the immune system. 

30 
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a. Combinatorial Peptide Libraries 

Peptide molecules which modulate the immune system may be obtained 
through the screening of combinatorial peptide libraries. Such libraries may either be 
prepared by one of skill m the artt^ee e.g., U.S. Patent Nos. 4,528^66 and 4,359,535, 
5 and Patent Cooperation Treaty Publication Nos. WO 92/15679, WO 92/15677, WO 
90/07862, WO 90/02809, or purchased ftom commercially available sources ie.g.. New 
England Biolabs Ph.D.™ Phage Display Peptide Library Kit). 

b. Antibodies 

1 0 Antibodies which modulate the immune system may readily be prepared 

given the disclosure provided herein. Within the context of the present invention, 
antibodies are understood to include monoclonal antibodies, polyclonal antibodies, anti- 
idiotypic antibodies, antibody fragments (e.g.. Fab, and F(ab')2, Fy variable regions, or 
complementarity determining regions). As discussed above, antibodies are understood 

15 to be specific against Fkh^ if they bind with a Kg of greater than or equal to lO^M. 
preferably greater than of equal to lO^M. The affmity of a monoclonal antibody or 
binding partner, as well as inhibition of binding can be readily determined by one of 
ordinary skill in the art (see Scatchard, Ann. N.Y. Acad. Sci. 57:660-672, 1949). 

Briefly, polyclonal antibodies may be readily generated by one of 

20 ordinary skill in the art from a variety of warm-blooded animals such as horses, cows, 
various fowl, rabbits, mice, or rats. Typically, Fkh'^, or a unique peptide thereof .of 13- 
20 amino acids (preferably conjugated to keyhole limpet hemocyanin by cross-linking 
with glutaraldehyde) is utilized to immunize the animal through intraperitoneal, 
intramuscular, intraocular, or subcutaneous injections, in conjunction with an adjuvant 

25 such as Freund's complete or incomplete adjuvant. Following several booster 
immunizations, samples of serum are collected and tested for reactivity to the protein or 
peptide. Particularly preferred polyclonal antisera will give a signal on one of these 
assays that is at least three times greater than background. Once the titer of the animal 
has reached a plateau in terms of its reactivity to the protein, larger quantities of antisera 

30 may be readily obtained either by weekly bleedings, or by exsangumating the animal. 
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Monoclonal antibodies may also be readily generated using conventional 
techniques {see U.S. Patent Nos. RE 32,011, 4,902,614, 4,543,439, and 4,411,993 
which are incorporated herein by reference; see also Monoclonal Antibodies, 
Hybridomas: A New Dimension in Biological Analyses, Plenum Press, Kennett, 

5 McKeam, and Bechtol (eds,), 1 980, and Antibodies: A Laboratory Manual, Harlow and 
Lane (eds.). Cold Spring Harbor Laboratory Press, 1988, which are also incorporated 
herein by reference). 

Other techniques may also be utilized to construct monoclonal antibodies 
(see WiUiam D. Huse etal, "Generation of a Large Combmational Library of the 

10 Immunoglobulin Repertoire in Phage Lambda," Science 2^6:1275-1281, December 
1989; see also L. Sastry etal., "Cloning of the Immunological Repertoire in 
Escherichia coli for Generation of Monoclonal Catalytic Antibodies: Construction of a 
Heavy Chain Variable Region-Specific cDNA Library," Proc, Natl Acad Sci, USA 
56:5728-5732, August 1989; see also Michelle Aiting-Mees etal., ''Monoclonal 

15 Antibody Expression Libraries: A Rapid Alternative to Hybridomas," Strategies in 
Molecular Biology J:l-9. January 1990). 

A wide variety of assays may be utilized to determine the presence of 
antibodies which are reactive against the Fkh'^ (or the mutant forms of Fkh'^ described 
herein), including for example countercurrent immuno-electrophoresis, 

20 radioimmunoassays, radioimmunoprecipitations, enzyme-linked immuno-sorbent 
assays (ELISA), dot blot assays, western blots, immunoprecipitation. Inhibition or 
Competition Assays, and sandwich assays {see U.S. Patent Nos. 4,376,1 10 and 
4,486,530; see also Antibodies: A Laboratory Manual, Harlow and Lane (eds.). Cold 
Spring Harbor Laboratory Press, 1 988). 

25 Once suitable antibodies have been obtained, they may be isolated or 

purified by many techniques well known to those of ordinary skill in the art {see 
Antibodies: A Laboratory Manual, Harlow and Lane (eds.). Cold Spring Harbor 
Laboratory Press, 1988). Suitable techniques include peptide or protein affinity 
columns, HPLC or RP-HPLC, purification on protein A or protein G columns, or any 

30 combination of these techniques. 
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Antibodies of the present invention may be utilized not only for 
modulating the immune system, but for diagnostic tests (e.g., to determine the presence 
of an FKH^ or Fkh^ protein or peptide), for therapeutic purpose, or for purification of 
proteins. 

5 

c. Mutant Fkh^ 

As described herein and below in the Examples, altered versions of Fkh'^^ 
may be utilized to inhibit the normal activity of Fkh^y thereby modulating the immune 
system (see generally, nucleic acid molecules and proteins above). 
10 Further mutant or altered forms of FKH^^ or Fkh^^ may be utilized for a 

wide variety of in vitro assays (e.g., in order to examine the affect of such proteins in 
various models), or, for the development of antibodies. 

15 3. Nucleic Acid Molecules 

Within other aspects of the invention, nucleic acid molecules are 
provided which are capable of modulating the immune system. For example, within 
one embodiment antisense oligonucleotide molecules are provided which specifically 
inhibit expression of FKIf^ or nucleic acid sequences, or, of mutant FKIf^or Fkh^ 

20 (see generally^ Hirashima et al. in Molecular Biology of RNA: New Perspectives (M. 
Inouye and B. S. Dudock, eds., 1987 Academic Press, San Diego, p. . 401); 
Oligonucleotides: Antisense Inhibitors of Gene Expression (J.S. Cohen, ed., 1989 
MacMillan Press, London); Stein and Cheng, Science 2^7:1004-1012, 1993; WO 
95/10607; U.S. Patent No. 5,359,051; WO 92/06693; and EP-A2-6 12844). Briefly, 

25 such molecules are constructed such that they are complementary to, and able to form 
Watson-Crick base pairs with, a region of transcribed Fkh^ mRNA sequence. The 
resultant double-stranded nucleic acid interferes with subsequent processing of the 
mRNA, thereby preventing protein synthesis. 

Within other aspects of the invention, ribozymes are provided which are 

30 capable of inhibiting FKff^ or Fkh'^, or mutant forms FKIT^ or Fldf^. As used herein. 
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"ribozymes" are intended to include RNA molecules that contain anti-sense sequences 
for specific recognition, and an RNA-cleaving enzymatic activity. The catalytic strand 
cleaves a specific site in a target RNA at greater than stoichiometric concentration. A 
wide variety of ribozymes may be* utilized within the context of the present invention, 
5 including for example, the hanamerhead ribozyme (for example, as described by Forster 
and Symons, Cell 48:21 1-220, 1987; Haseloff and Gerlach, Nature 52<?:596-600, 1988; 
Walbot and Bruening, Nattire 334:196, 1988; Haseloff and Gerlach, Nature 55^:585, 
1988); the hairpin ribozyme (for example, as described by Haselofif et al., U.S. Patent 
No. 5,254,678, issued October 19, 1993 and Hempel et aL, European Patent Publication 
10 No. 0 360 257, published March 26, 1990); and Tetrahvmena ribosomal RNA-based 
ribozymes (see Cech et al., U.S. Patent No. 4,987,071). Ribozymes of the present 
invention typically consist of RNA, but may also be composed of DNA, nucleic acid 
analogs (e.g., phosphorothioates), or chimerics thereof {e.g., DNA/'RNA/RNA). 

15 4. Labels 

FKH^ox Ffdf^, (as well as mutant forms thereof), or, any of the candidate 
molecules described above and below, may be labeled with a variety of compounds, 
including for example, fluorescent molecules, toxins, and radionuclides. Representative 
examples of fluorescent molecules include fluorescein, Phycobili proteins, such as 

20 phycoerythrin, rhodamine, Texas red and luciferase. Representative examples of toxins 
include ricin, abrin diphtheria toxin, cholera toxin, gelonin, pokeweed antiviral protein, 
tritin. Shigella toxin, and Pseudomonas exotoxin A. Representative examples of 
radionuclides include Cu-64, Ga-67, Ga-68, Zr-89, Ru-97, Tc-99m, Rh-105, Pd-109, In- 
1 11, 1-123, 1-125, 1-131, Re-186, Re-188, Au-198, Au-199, Pb-203, At-21 1, Pb-212 and 

25 Bi-212. In addition, the antibodies described above may also be labeled or conjugated 
to one partner of a ligand binding pair. Representative examples include avidin-biotin, 
and riboflavin-riboflavin binding protein. 

Methods for conjugating or labeling the molecules described herein with 
the representative labels set forth above may be readily accomplished by one of 

30 ordinary skill in the art {see Trichothecene Antibody Conjugate. U.S. Patent No. 
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4,744,981; Antibody Conjugate, U.S. Patent No. 5,106,951; Fluorogenic Materials and 
Labeling Techniques, U.S. Patent No. 4,018,884; Metal Radionuclide Labeled Proteins 
for Diagnosis and Therapy, U.S. Patent No. 4,897,255; and Metal Radionuclide 
Chelating Compounds for Improved Chelation Kinetics, U.S. Patent No. 4,988,496; see 
5 also Inman, Methods In Emymology, Vol. 34, Affinity Techniques. Enzyme 
Purification: Part B, Jakoby and Wilchek (eds.). Academic Press, New York, p. 30, 
1974; see also Wilchek and Bayer, "The Avidin-Biotin Complex in Bioanalytical 
Applications," Biochem. 777:1-32, 1988). 

10 Pharmaceutical CoMPOSiTio>4s 

As noted above, the present invention also provides a variety of 
pharmaceutical compositions, comprising one of the above-described molecules which 
modulates the immune system, along with a pharmaceutically or physiologically 
acceptable carrier, excipients or diluents. Generally, such carriers should be nontoxic to 

15 recipients at the dosages and concentrations employed- Ordinarily, die preparation of 
such compositions entails combining the therapeutic agent with buffers, antioxidants 
such as ascorisic acid, low molecular weight (less than about 10 residues) polypeptides, 
proteins, amino acids, carbohydrates including glucose, sucrose or dextrins, chelating 
agents such as EDTA, glutathione and other stabilizers and excipients. Neutral buffered 

20 saline or saline mixed with nonspecific serum albumin are exemplary appropriate 
diluents. Preferably, the pharmaceutical composition (or, 'medicament') is provided in 
sterile, pyrogen-free form. 

In addition, the pharmaceutical compositions of the present invention 
may be prepared for administration by a variety of different routes. In addition, 

25 pharmaceutical compositions of the present invention may be placed within containers, 
along with packaging material which provides instiiictions regarding the use of such 
pharmaceutical compositions. Generally, such instructions will include a tangible 
expression describing the reagent concentration, as well as within certain embodiments, 
relative amounts of excipient ingredients or diluents {e.g., water, saline or PBS) which 

30 may be necessary to reconstitute the pharmaceutical composition. 
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Methods of Treatment 
The present invention also provides methods for modulating the immune 

system. Through use of the molecules described herein which modulate the immune 
5 * system, a wide variety of conditions in warm blooded animals may be readily treated or 

prevented. Examples of warm-blooded animals that may be treated include both 

vertebrates and mammals, including for example humans, horses, cows, pigs, sheep, 

dogs, cats, rats and mice. Such methods may have therapeutic value in patients with 

altered immune systems. This would include such patients as those undergoing 
10 chemotherapy of those with various immunodeficiency syndromes, as well as patients 

with a T cell mediated autoinmiune disease. Therapeutic value may also be recognized 

from utility as a vaccine adjuvant. 

Therapeutic molecules, depending on the type of molecule, may be 

administered via a variety of routes in a variety of tbrmulations. For example, within 
15 one embodiment organic molecules may be delivered by oral or nasal routes, or by 

injection (e.g., intramuscularly, intravenously, and the like). 

Within one aspect, methods are provided for modulating the immune 

system, comprising the step of introducing into lymphoid cells a vector which directs 

the expression of a molecule which modulates the immune system, and administering 
20 the vector containing cells to a warm-blooded animal. Within other related 

embodiments, the vector may be directly administered to a desired target location (e,g.. 

the bone marrow). 

A wide variety of vectors may be utilized for such therapeutic purposes, 
including both viral and non-viral vectors. Representative examples of suitable viral 

25 vectors include herpes viral vectors (e.g., U.S. Patent No. 5,288,641), adenoviral vectors 
(e.g., WO 94/26914, WO 93/9191 WO 99/20778; WO 99/20773; WO 99/20779; Kolls 
etal., PNAS Pi(l):215-219, 1994; Kass-Eisler etal., PNAS P0(24):l 1498-502, 1993; 
Guzman etal, Circulation 5«(6):2838-48, 1993; Guzman etal.. Or. Res, 75(6): 1202- 
1207, 1993; Zabner etal.. Cell 75(2): 207-2 16, 1993; Li etaL, Hum Gene Then 

30 ^(4):403-409, 1993; Caillaud etal, Eur 1 Neurosci, 5(10:1287-1291, 1993; Vincent 
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etai., Nat Genet. 5{2):130-134, 1993; JafFe etal.,iVa/'. Genet. /(5):372-378, 1992; and 
Levrero etal.. Gene 7 W(2): 195-202, 1991), adeno-associated viral vectors (WO 
95/13365; Flotte etal., PNAS P0(22):10613-10617, 1993), baculovirus vectors, 
parvovirus vectors (Koering et alT, Hum. Gene Therap. 5:457-463, 1994), pox virus 

5 vectors (Panicaii and Paoletti, PNAS 79:4927-4931, 1982; and Ozaki et al., Biochem. 
Biophys. Res. Comm. iP5(2):653-660, 1993), and retroviruses (e.g., EP 0,415,731; WO 
90/07936; WO 91/0285, WO 94/03622; WO 93/25698; WO 93/25234; U.S. Patent 
No. 5,219,740; WO 93/11230; WO 93/10218). Viral vectors may likewise be 
constructed wMch contain a mixture of different elements (e.g., promoters, envelope 

10 sequences and the like) from different viruses, or non-viral sources. Within various 
embodiments, either the viral vector itself, or a viral particle which contains the viral 
vector may be utilized in the methods and compositions described below. 

Within other embodiments of the invention, nucleic acid molecules 
which encode a molecule which modulates die immune system (e.g.. a mutant Fkh'-^, or, 

15 an antisense or ribozyme molecule which cleaves Fkh") may be administered by a 
variety of alternative techniques, including for example administration of 
asialoosomucoid (ASOR) conjugated with poly-L-lysine DNA complexes (Cristano 
et al., PNAS 92122-92126, 1993), DNA linked to killed adenovirus (Curiel et al.. Hum. 
Gene Then i(2):147-154, 1992), cytofectin-mediated introduction (DMRIE-DOPE, 

20 Vical, California), direct DNA injection (Acsadi etal.. Nature 352:815-818, 1991); 
DNA ligand (Wu et al., J. of Biol. Chem. 26-/: 16985-1 6987, 1989); lipofection (Feigner 
etal., Proc. Natl. Acad Set USA <S-/:74 13-741 7, 1989); liposomes (Pickering etal., 
Circ. 5P(1):13-21, 1994; and Wang etal., PNAS 84m5l-n55, 1987); microprojectile 
bombardment (Williams etal.. PNAS 55:2726-2730, 1991); and direct delivery of 

25 nucleic acids which encode the protein itself either alone (Vile and Hart, Cancer Res. 
53: 3860-3864, 1993), or utilizing PEG-nucleic acid complexes. 

Representative examples of molecules which may be expressed by the 
vectors of present invention include ribozymes and antisense molecules, each of which 
are discussed in more detail above. 
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As will be evident to one of skill in the art, the amount and frequency of 
administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of the patient, and so forth. 
Typically, the compositions maybe administered by a variety of techniques, as noted 
above. 

The following examples are offered by way of illustration, and not by 
way of limitation. 
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EXAMPLES 
EXAMPLE 1 

5 Identification of the Gene Responsibile for the Scurfy Mutant 

A. Cloning of a Scurfy gene 

The original scurfy mutation arose spontaneously in the partially inbred 
MR stock at Oak Ridge National Laboratory (ORNL) in 1949. Backcross analysis was 

10 used to fine map the peri-centromeric region of the X chromosome containing the 
mouse Scurfy mutation. A physical map covering the same region was generated 
concurrently through the isolation of overlapping yeast and bacterial artificial 
chromosomes (YACs and BACs), Once the candidate region was narrowed down to 
-500 kilobase pairs (kb), large-scale DNA sequencing was performed on 4 overlapping 

15 BAG clones. All the transcription units in this 500 kb region were identified through a 
combination of sequence database searching and the application of computer exon 
prediction programs. Candidate genes were then screened for Scurfy-specific mutations 
by comparing the sequences of cDNAs obtained by the Reverse Transcription- 
Polymerase Chain Reaction (RT-PCR) procedure from normal and Scurfy-derived RNA 

20 samples. In one gene, referred to here as FAA^, a two base pair (bp) insertion was found 
in the coding region of the Sciufy cDNA, relative to the normal cDNA. The insertion 
was confirmed by comparing the DNA sequences of PCR products derived from the 
genomic DNA of several mouse strains, including the Scurfy mutant. Again, the two bp 
insertion was found only in the Scurfy sample, establishing this as the probable cause of 

25 the Scurfy defect. 

The mouse Fldf^ gene is contained within the BAC clone 8C22, and has 
been completely sequenced. It spans -14 kb and contains 11 coding exons. The 
locations of exon breaks were initially identified by computer analysis of the genomic 
DNA sequence, using the GenScan exon prediction program; exon locations were then 
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confirmed by direct comparison of cDNA sequences derived from nonnai mouse tissues 
to the genomic sequence. 

The length of cDNA obtained is 2160 bp; the coding region spans 1287 
bp of that, encoding a protein of 429 amino acids. Figure I shows the nucleotide 
5 sequence of the mouse Fkh^cDNA; translation is predicted to initiate at position 259 
and terminate at position 1546. Figure 2 shows the amino acid sequence of mouse 
Fkh^^ 

b. Generation of Fkh^ transgenic mice. 

10 The identity of the Fkh^ gene as the true cause of the Scurfy phenotype 

was confirmed in transgenic mice. Briefly, a 30 kb fragment of the normal genomic 
DNA, including the -7 kb coding region of the Fkh'^ gene, as well as --20 kb of 
upstream flanking sequences and -4 kb of downstream sequences (Figure 5) was 
microinjected into normal mouse one-cell embryos. Five individual founder animals 

15 were generated, each with distinct integrations, and a male animal from each transgenic 
line was crossed to a female sf carriers. Male offspring carrying both the transgene 
(normal Ffdf^) and sf mutation (mutant F/df^ were analyzed. 

Analysis consisted of examination of animals for runting, scaly skin, fur 
abnormalities and other hallmarks of the scurfy phenotype. In addition, lymphoid 

20 tissues (thymus, spleen and nodes) were harvested and their size and cell number 
examined and compared to both normal animals as well as scurfy mice. For all five 
transgenic lines, male sf progeny that carried the transgene were normal in size and 
weight and appeared healthy in all respects. Lymph node size in these transgenic mice 
was similar to (or smaller than) that of normal animals (Figure 6) and there was no sign 

25 of activated T cells. These parameters are extremely different from sf mice and indicate 
that addition of the normal Fkh'-^ gene can overcome the defect found in scurfy mice, 
thus confirming that the mutation in the FW^gtnt is the cause of Scurfy disease. 
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EXAMPLE 2 
Generation of Fki^cDNA 

A complementary DNA (cDNA) encoding the complete mouse Fkh^ 
5 protein may be obtained by a reverse-transcriptase polymerase chain reaction (RT-PCR) 
procedure. More specifically, first-strand cDNA is generated by oligo dT priming 5 ug 
of total RNA firom a suitable source (eg., mouse spleen) and extending with reverse 
transcriptase under standard conditions (eg., Gibco/BRL Superscript kit). An aliquot of 
the first-strand cDNA is then subjected to 35 cycles of PGR (94°C for 30 sec, 63°C for 
10 30 sec, 72°C for 2 min) in the presence of the forward and reverse primers (Forward 
primer: GCAGATCTCC TGACTCTGCC TTC; Reverse primer: GCAGATCTGA 
CAAGCTGTGT CTG) (0.2 mM final concentration), 60 mM Tris-HCl, 15 mM 
ammonium sulfate, 1.5 mM magnesium chloride, 0.2 mM each dNTP and 1 unit of Taq 
polymerase. 

15 EXAMPLE 3 

Generation of the human ortholog to Murine Fki^ 

A human FKIf^ cDNA encoding the complete FKH^ protein may be 
obtained by essentially the same procedure as described in Example 2. In particular, 

20 starting with total spleen RNA, and utilizing the following oligonucleotide primers 
(Forward primer: AGCCTGCCCT TGGACAAGGA C; Reverse primer: 
GCAAGACAGT GGAAACCTCA C), and the same PGR conditions outlined above, 
except with a GO'^C annealing temperature. 

Figure 4 shows the nucleotide sequence of the 1869 bp cDNA obtained 

25 to date (including an 1293 bp coding region); translation is predicted to initiate at 
position 189 and terminate at position 1482. Figure 4 shows the sequence of the 431 
amino acid human FKH^^ protein. Gomparison of the predicted coding region of the 
human gene to the mouse cDNA sequence reveals nearly identical exon structure and 
86,1% amino acid sequence identity across the entire protein. 

30 
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EXAMPLE 4 
Methods for detecting Scurfy Mutations 

As noted above, *the Scurfy mutation was originally discovered by 
5 direcdy sequencing cDNAs derived by RT-PCR of sf and normal mouse RNA samples, 
and confirmed by sequencing the same region from genomic DNA, The nature of the 
mutation {i.e., a 2 bp insertion) lends itself to a number of different mutation detection 
assays. The first is based on differential hybridization of oligonucleotide probes. Such 
a hybridization-based assay could allow quantitative analysis of allele-specific 
10 expression. 

As an example, a 360 bp DNA Segment is amplified from 1^ strand 
cDNA using the following oligos: 

DM05985 (forward): CTACCCACTGCTGGCA.AATG (ntd. 825-844 of Figure. 1) 

15 

DM06724 (reverse): GAAGGAACTATTGCCATGGCTTC (ntd 1221-1199) 

20 The PGR products are run on an 1.8% agarose gel, transferred to nylon 

membrane and probed with end-labeled oligonucleotides that are complementary-to the 
region corresponding to the site of the Scurfy-specific 2 bp insertion. Two separate 
hybridization reactions are performed to detect the normal and Scurfy PGR products, 
using the oligonucleotides below (the site of the 2 bp insertion is shown in bold): 

25 

Normal: ATGCAGCAAGAGCTCTTGTCCATTGAGG 
DM07439 

Scurfy:GCAGCAAGAGCTCTTTTGTCCATTGAGG 
30 DM06919 
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The Scurfy mutation can also be detected by a cold Single-Strand 
Conformation Polymorphism (cSSCP) assay. In this assay, the same PGR products 
described above are run on 20% acrylamide (TBE) gels after strand denaturation. The 
5 Scurfy insertion causes a shift in strand mobility, relative to the normal sequence, and * 
the separate strands are detected after staining with ethidium bromide. 

EXAMPLES 

F^/f^GENE EXPRESSION 

10 

Semi-quantitative RT-PCR has been used to analyze the pattern of 
mouse and human Fkh^ gem expression in a wide variety of tissues and cell lines. 
Levels of expression are normalized to the ubiquitously expressed DAD-1 gene. In 
short, the Fkh"^ gene is expressed, albeit at very low levels, in nearly every tissue 
15 examined thus far, including thymus, spleen, sorted CD4+ and CD4-CD8- T- 
lymphocytes, as well as kidney, brain, and various mouse and human T-celi lines and 
human tumors. Absence of expression, however, was noted in freshly sorted mouse B- 
cells. 

As expected, no differences in level of expression were observed in 
20 normal vs. Scurfy tissues in the RT-PCR assays. 

EXAMPLE 6 
In vitro Expression of Fkh^ 

25 Full-length mouse and human Fkh^ cDNAs, as well as various sub- 

regions of the cDNAs are cloned into vectors which allow expression in mammalian 
cells (such as the human Jurkat T-cell line), E. coli or yeast. The E, coU or yeast 
systems can be used for production of protein for the purpose of raising Fkh^ -specific 
antibodies (see below). 

30 
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EXAMPLE? 
Generation of anti-Ffch^*' antibodies 

Protein expressed from vectors described in example 6 are used to 
immunize appropriate animals for the production of FKH^ specific antibodies. Either 
fiill length or truncated proteins can be used for this purpose. Protein can be obtained, 
for example, from bacteria such as £ coli, insect cells or mammalian cells. Animal 
species can include mouse, rabbit, guinea pig, chicken or other. Rabbit antisera specific 
for FKH"^ has been generated, as determined by biochemical characterization 
(immunoprecipitation and western blotting). 

EXAMPLE 8 
Assay for Function of an FKH'^ gene 

Since loss of fimction of the FKH^ protein results in the phenotype 
observed in scurfy animals (wasting, hyperactive immune responsiveness and death), 
assays are described for assessing excessive expression of the FKH'*^ protein. 
Transgenic animals (described in Example 1) are examined for their state of immime 
competence, using several different parameters. Animals are examined for the number 
of lymphoid cells present in lymph nodes and thymus (Figure 7) as well as the 
responsiveness of T cells to in vitro stimulation (Figure 8). 

Scurfy mutant animals have roughly twice as many cells in their lymph 
nodes as normal animals, whereas mice which express excess levels of the normal 
FKH"^ protein contain roughly one-third as many cells (Figure 7). Further, the number 
of thymocytes is normal (Figxire 7) as is their cell surface phenotype as assessed by flow 
cytometry using standard antisera (not shown), indicating that there is no developmental 
defect associated with excess FKH^ protein. 

Normal, scurfy and transgenic animals are further examined for their 
proliferative responses to T cell stimulation. CD44- T cells are reacted with antibodies 
to CD3 and CD28 and their proliferative response measured using radioactive 
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thymidine incorporation* Whereas only scurfy cells divide in the absence of 
stimulation, normal cells respond well following stimulation, FKH^ transgenic cells 
also respond to stimulation, however the response is significantly less than that of 
normal cells (Figure 8). This indicates that CD4+ T cells that express excess FKH'^ 
have a diminished capacity to respond to stimxili. 
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EXAMPLE 9 
Human FKH"^ cDNA sequence is related to JM2 

A modified version of the human FKH^ cDNA sequence exists in the 
5 GenBank public sequence database. This sequence is called JM2 (GenBank acc. # 
AJ0O5891), and is the result of the application of exon prediction programs to the 
genomic sequence containing the FKH^ gene (Strom, T.M. et aL, unpublished - see 
GenBank acc. # AJ005891). In contrast, the structure of the FKH^ cDNA was 
determined experimentally. The GAP program of the Genetics Computer Group (GCG; 

10 Madison, USA) Wisconsin sequence analysis package was used to compare the two 
sequences, and the differences are illustrated in Figure 9. The 5' ends of the two 
sequences differ in their location within the context of the genomic DNA sequence, the 
second coding exon of FKIT^ is omitted from JM2, and the last intron of the FXZT^gene 
is unspliced in the JM2 sequence. These differences result in a JM2 protein with a 

15 shorter amino-terminal domain, relative to FKH^^, and a large insertion within the 
forkhead domain (see below) at the carboxy-terminus. 

EXAMPLE 10 
The FKff^ protein is conserved across species 

20 

The FKH'^ protein can be divided into sub-regions, based on sequence 
motifs that may indicate functional domains. The two principal motifs in FKH^^are the 
single zinc finger (ZNF) of the C2H2 class in the middle portion of the protein, and the 
forkhead, or winged-helix domain at the extreme carboxy-terminus of the protein. For 
25 the purposes of characterizing the degree of homology between FKH^ and other 
proteins, we have split the protein up into four regions: 
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Amino-terminal domain: residues 1 - 1 97 of Figure 2 

residues 1-198 of Figure 4 
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Zinc finger domain: 



residues 198-221 of Figure 2 
residues 199-222 of Figure 4 



Middle domain: 



residues 222-336 of Figure 2 
residues 223-336 of Figure 4 



5 



Forkhead domain: 



residues 337-429 of Figure 2 
residues 337-431 of Figure 4 
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Using the Multiple Sequence Alignment program from the DNAStar 



sequence analysis package, the Lipman-Pearson algorithm was employed to determine 
the degree of similarity between the human FKH'^and mouse Fkh'^ proteins across these 
four domains. The results are shown in Figure 10. The similarity indices ranged from 
82.8% to 96.4%, indicating that this protein is very highly conserved across species. 



20 other novel genes (and proteins) which fall into the same sub-class of forkhead- 
containing molecules. The FKH"^ protein is unique in its having a single zinc finger 
domain amino-terminal to the forkhead domain as well as in the extreme carboxy- 
terminal position of the forkhead domain. A degenerate PGR approach may be taken to 
isolate novel genes containing a zinc finger sequence upstream of a forkhead domain. 

25 By way of example, the following degenerate primers were synthesized (positions of 
degeneracy are indicated by parentheses, and "I" indicates the nucleoside inosine): 



EXAMPLE 11 



Identification of novel Fkh"'-] 



RELATED GENES 



The unique features of the FKH^ gene sequence may be used to identify 



Forward primer: CA(TC)GGIGA(GA)TG(CT)AA(GA)TGG 

Reverse primer: (GA)AACCA(GA)TT(AG)TA(AGT)AT(CT)TC(GA)TT 

30 
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The forward primer corresponds to a region within the zinc finger 
sequence and the reverse primer corresponds to a region in the middle of the forkhead 
domain. These primers were used to amplify first-strand cDNA produced as in 
Example 2 firom a variety of human tissues {including liver, spleen, brain, lung, kidney, 
5 etc.). The following PGR conditions were used: forward and reverse primers at 0.2 
mM final concentration, 60 mM Tris-HCl, 15 mM ammonium sulfate, 1.5 mM 
magnesium chloride, 0,2 mM each dNTP and 1 unit of Taq polymerase, subjected to 35 
cycles (94X for 30 sec, 50°C for 30 sec, ITC for 2 min), PGR products were 
visualized on a 1.8% agarose gel (run in Ix TAE) and sub-cloned into the TA cloning 
10 vector (Invitrogen, Garlsbad, GA); individual clones were sequenced and used for 
fiirther characterization of full-length cDNAs. 

Alternatively, the unique regions of the FKIP^ gene (i.e., the "Amino- 
terminal" and "Middle" domains) may be used to screen cDNA libraries by 
hybridization. cDNA libraries, derived from a variety of human and/or mouse tissues, 
15 and propagated in lambda phage vectors (eg., lambda gtl I) were plated on agarose, 
plaques were transferred to nylon membranes and probed with fi:agments derived from 
the unique regions of the FKIT^ gene. Under high stringency conditions (eg., 
hybridization in 5x SSPE, 5x Denhardt's solution, 0.5% SDS at 65°G, washed in 0.1 x 
SSPE, 0.1% SDS at 65G) only very closely related sequences are expected to hybridize 
20 (i.e,, 90-100% homologous). Under lower stringency, such as hybridization and 
washing at 45*'-55''G in the same buffer as above, genes that are related to FKIf^ (65- 
90% homologous) may be identified. Based on results obtained from searching public 
databases with the unique sequences of FKH'*^ any genes identified through low- to 
mid-stringency hybridization experiments are expected to represent novel members of a 
25 "FKH^ family". 

From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating fi-om the spirit and scope of the 
30 invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 



We claim: 



L 



An isolated nucleic acid molecule which encodes Fkh"^. 



2. The isolated nucleic acid molecule according to claim I, wherein said 
Fkh^^ is murine Fkh^ 

3. The isolated nucleic acid molecule according to claim 1, wherein said 
Fkh^ is human FKH^ 



nucleic acid molecule is selected from the group consisting of (a) a nucleic acid molecule that 
encodes an amino acid sequence comprising SEQ ID NOS 2, or, 4, (b) a nucleic acid 
molecule that hybridizes imder stringent conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NOS: 1, or, 3, or its complement, and (c) a nucleic acid 
molecule that encodes a functional fragment of the polypeptide encoded by either (a) or (b). 

5. The isolated nucleic acid molecule of claim 1, wherein said nucleic 
acid molecule encodes the amino acid sequence of SEQ ID N0:2. 

6. The isolated nucleic acid molecule of claim 5, wherein said nucleic 
acid molecule comprises the nucleotide sequence of SEQ ID NO: 1 . 



4. 



The isolated nucleic acid molecule acccyding to claim I, wherein said 



7. 



A vector comprising the isolated nucleic acid molecule 



of claim 1 . 



8. 



The vector according to claim 7 wherein said vector is a 



viral vector. 
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9- The vector according to claim 8 wherein said viral 

vector is generated from a virus selected from the group consisting of retrovirus, adenovirus, 
herpes virus, adeno-associated virus and alphavirus. 



10. An expression vector comprising the isolated nucleic acid molecule of 
claim 1 and a promoter, wherein said promoter is operably linked with said nucleic acid 
molecule. 



1 L A recombinant host cell comprising the expression vector of claim 1 0. 

12. A method of using the expression vector of claim 10 to prepare Fkh^ 
protein, said method comprising the steps of: 

(a) culturing recombinant host cells that comprise said expression 
vector and that produce said protein, and 

(b) isolating said protein from said cultured recombinant host cells, 

13. An isolated polypeptide comprising an amino acid sequence encoded 
by the nucleic acid molecule according to anyone of claims I to 6. 

14. An antibody or antibody fragment that binds specifically with the 
polypeptide encoded by the nucleic acid molecule according to claim 1 . 

15. The antibody of claim 13, wherein said antibody is selected from the 
group consisting of: 

(a) polyclonal antibody, 

(b) murine monoclonal antibody, 

(c) humanized antibody derived from (b), and 

(d) human monoclonal antibody. 
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16. The antibody fragment of claim 14, wherein said antibody fragment is 
selected from the group consisting of F(ab')2> F(ab)2, Fab', Fab, Fv, sFv, and minimal 
recognition unit. 

1 7. A fusion protein comprising the polypeptide according to claim 13. 

1 8. A method of detecting the presence of a Fkl^ nucleic acid sequence in 
a biological sample from a subject, comprising the steps of : 

(a) contacting a FW^ specific nucleic acid probe under hybridizing 
conditions with either (i) test nucleic acid molecules isolated from said biological 
sample, or (ii) nucleic acid molecules synthesized from RNA molecules, wherein said 
probe recognizes at least a portion of nucleotide sequence of claim 1, and 

(b) detecting the formation of hybrids of said nucleic acid probe and (i) or 

(ii). 

19. The method according to claim 18, wherein said test nucleic acid 
molecule is obtained by RT-PCR. 

20 A method of detecting the presence of an Fkh'^, or a mutant form 
thereof, in a biological sample, comprising the steps of: 

(a) contacting said biological sample with an anti- Fkh'^ antibody or an 
antibody fragment, wherein said contacting is performed under conditions that allow 
the binding of said antibody or antibody fragment to said biological sample, and 

(b) detecting any of said bound antibody or bound antibody fragment 

21. The method of claim 20, wherein said antibody or said antibody 
fragment is selected from the group consisting of: 

(a) polyclonal antibody, 

(b) a murine monoclonal antibody, 

(c) a humanized antibody derived from (b). 
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(d) a human monoclonal antibody, and 

(e) an antibody fragment derived from (b), (c) or (d). 

22. The method of claim 20, wherein said antibody fragment is selected 
from the group consisting of F(ab')2, F(ab)2, Fab', Fab, Fv, sFv, and minimal recognition unit 

23. The method of claim 20, wherein said antibody or said antibody 
fragment fiirther comprises a detectable label selected from the group consisting of 
radioisotope, fluorescent label, chemiluminescent label, enzyme label, bioluminescent label, 
and colloidal gold. 

24. An isolated oligonucleotide which is capable of hybridizing to the 
nucleic acid molecule according to claim 1. 

25. The oligonucleotide according to claim 23, fiirther 
comprising a detectable label. 

26. A method of introducing a Fkht^ nucleic acid molecule 
to an animal, comprising the step of administering a FA/?'^ nucleic acid molecule according to 
claim 1 to an animal. 

27. The method according to claim 26 wherein said nucleic 
acid molecule is expressed by a viral vector. 

28. The method according to claim 26 wherein said nucleic 
acid molecule is expressed by a plasmid vector. 

29. The method according to claim 26 wherein said nucleic 
acid molecule is administered to an animal in vivo. 

30. The method according to claim 26 wherein said nucleic 
acid molecule is administered to cells ex vivo, and said cells are then administered to said 
animal. 

3 1 - The method according to claim 26 wherein said cells are 

hematopoietic cells. 
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32. The method according to claim 26 wherein said 
hematopoietic cells are T cells. 

33. The method according to claim 26 wherein said animal 
is selected from the group consisting of humans, monkeys, dogs, cats, rats and mice. 

34. A transgenic non-human animal whose cells express a transgene that 
contains a sequence encoding Fkh^ protein. 
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IDENTIFICATION OF THE GENE CAUSING THE MOUSE SCURFY 
PHENOTYPE AND ITS HUMAN ORTHOLOG 



ABSTRACT OF THE DISCLOSURE 



Isolated nucleic acid molecules are provided which encode Fkh^, as well as 
mutant forms thereof. Also provided are expression vectors suitable for expressing such 
nucleic acid molecules, and host cells containing such expression vectors. Utilizing assays 
based upon the nucleic acid sequences disclosed herein (as well as mutant forms thereof), 
numerous molecules may be identified which modulate the immune system. 



Express Mail Label Na.: 

MOUSE Fkh'^oDNA SEQUENCE 



1 GCTGATCCCC CTCTAGCAGT CCACTTCACC AAGGTGAGCG AGTGTCCCTG 

51 CTCTCCCCCA CCAGACACAG CTCTGCTGGC GAAAGTGGCA GAGAGGTATT 

101 GAGGGTGGGT GTCAGGAGCCTCACCAGTACA GCTGGAAACA CCCAGCCACT 

151 CCAGCTCCCG GCAACTTCTC CTGACTCTGC CTTCAGACGA GACTTGGAAG 

201 ACAGTCACAT CTCAGCAGCT CCTCTGCCGT TATCCAGCCT GCCTCTGACA 

251 AGAACCCAAT GCCCAACCCT AGGCCAGCCA AGCCTATGGC TCCTTCCTTG 

301 GCCCTTGGCC CATCCCCAGG AGTCTTGCCA AGCTGGAAGA CTGCACCCAA 

351 GGGCTCAGAA CTTCTAGGGA CCAGGGGCTC TGGGGGACCC TTCCAAGGTC 

401 GGGACCTGCG AAGTGGGGCC CACACCTCTT CTTCCTTGAA CCCCCTGCCA 

451 CCATCCCAGC TGCAGCTGCC TACAGTGCCC CTAGTCATGG TGGCACCGTC 

501 TGGGGCCCGA CTAGGTCCCT CACCCCACCT ACAGGCCCTT CTCCAGGACA 

551 GACCACACTT CATGCATCAG CTCTCCACTG TGGATGCCCA TGCCCAGACC 

601 CCTGTGCTCC AAGTGCGTCC ACTGGAOVAC CCAGCCATGA TCAGCCTCCC 

651 ACCACCTTCT GCTGCCACTG GGGTCTTCTC CCTCAAGGCC CGGCCTGGCC 

701 TGCCACCTGG GATa\ATGTG GCCAGTCTGG AATGGGTGTC CAGGGAGCCA 

751 GCTCTACTCT GCACCTTCCC ACGCTCGGGT ACACCCAGGA AAGACAGCAA 

801 CCTTTTGGCT GCACCCCAAG GATCCTACCC ACTGCTGGCA AATGGAGTCT 

851 GCAAGTGGCC TGGTTGTGAG AAGGTCTTCG AGGAGCCAGA AGAGTTTCTC 

901 AAGCACTGCC AAGCAGATCA TCTCCTGGAT GAGAAAGGCA AGGCCCAGTG 

951 CCTCCTCCAG AGAGAAGTGG TGCAGTCTGT GGAGCAGCAG CTGGAGCTGG 

1001 AAAAGGAGAA GCTGGGAGCT ATGCAGGCCC ACCTGGCTGG GAa.GATGGCG 

1051 CTGGCCAAGG CTCCATCTGT GGCCTCAATG GACAAGAGCT CTTGCTGCAT 

1101 CGTAGCC^CC AGTACTCAGG GCAGTGTGCT CCCGGCCTGG TCTGCTCCTC 

1151 GGGAGGCTCC AGACGGCGGC CTGTTTGCAG TGCGGAGGCA CCTCTGGGGA 

1201 AGCCATGGCA ATAGTTCCTT CCCAGAGTTC TTCCACAACA TGGACTACTT 

1251 CAAGTACCAC AATATGCGAC CCCCTTTCAC CTATGCCACC CTTATCCGAT 

1301 GGGCCATCCT GGAAGCCCCG GAGAGGCAGA GGAOVCTCAA TGAAATCTAC 

1351 aATTGGTTTA CTCGCATGTT CGCCTACTTC AGAAACCACC CCGCCACCTG 

1401 GAAGAATGCC ATCCGCCACA ACCTGAGCCT GCACAAGTGC TTTGTGCGAG 

1451 TGGAGAGCGA GAAGGGAGCA GTGTGGACCG TAGATGAATT TGAGTTTCGC 

1501 AAGAAGAGGA GCCAACGCCC CAACAAGTGC TCCAATCCCT GCCCTTGACC 

1551 TCAAAACCAA GAAAAGGTGG GCGGGGGAGG GGGCCAAAAC CATGAGACTG 

1601 AGGCTGTGGG GGCAAGGAGG CAAGTCCTAC GTGTACCTAT GGAAACCGGG 

1651 CGATGATGTG CCTGCTATCA GGGCCTCTGC TCCCTATCTA GCTGCCCTCC 

1701 TAGATCATAT CATCTGCCTT ACAGCTGAGA GGGGTGCCAA TCCCAGCCTA 

1751 GCCCCTAGTT CCAACCTAGC CCCAAGATGA ACTTTCCAGT CAAAGAGCCC 

1801 TCACAACCAG CTATACATAT CTGCCTTGGC CACTGCCAAG CAGAAAGATG 

1851 ACAGACACCA TCCTAATATT TACTCAACCC AAACCCTAAA ACATGAAGAG 

1901 CCTGCCTTGG TACATTCGTG AACTTTCAAA GTTAGTCATG CAGTCACACA 

1951 TGACTGCAGT CCTACTGACT CACACCCCAA AGCACTCACC CACAACATCT 

2001 GGAACCACGG GCACTATCAC ACATAGGTGT ATATACAGAC CCTTACACAG 

2 051 CAACAGCACT GGAACCTTCA CAATTACATC CCCCCAAACC ACACAGGCAT 

2101 AACTGATCAT ACGCAGCCTC AAGCAATGCC C^VAAATACAA GTCAGACACA 

2151 GCTTGTCAGA 



Figure I 



MOUSE Fkli^ PROTEIN SEQUENCE 



1 MPNPRPAKEW APSLALGPSP GVLPSWKTAP KGSELLGTRG SGGPFQGRDL 

51 RSGAHTSSSL NPLPPSQLQL PTVPLVMVAP SGARLGPSP5 LQALLQDRPH 

101 FMHQLSTVDA HAQTPVLQVR PliDNPAMISL PPPSAATGVF SLKARPGLPP 

151 GINVASI*EWV SREPAIULCTF PRSGTPRKDS NLLAAPQGSy PLIiANGVCKW 

201 PGCEKVFEEP EEFLKHCQM HLLDEKGKAQ CIiLQRSWQS LEQQLELEKE 

251 KLGAMQAHLA GKMALAKAPS VASMDKSSCC IVATSTQGSV LPAWSAPREA 

301 PDGGLFAVRR HLWGSHGNSS FPEFFHNMDY FKYHNMRPPF TOVTLIRWAI 

351 LEAPERQRTL NEIYHWFTRM FAYFRNHPAT WKNAIRHKLS LHKCFVRVES 

401 EKGAVWTVDE FBFRKKRSQR PNKCSNPCP* 



Figure 2 



HUMAN FKIF^cDNA Sequence 



1 GCACACACTC ATCGAAAA^A ATTTGGATTA TTAGAAGAGA GAGGTCTGCG 

51 GCTTCCACAC CGTACAGCGr-GGTTTTTCTT CTCGGTATAA AAGCAAAGTT 

101 GTTTTTGATA CGTGACAGTT TCCCACAAGC CAGGCTGATC CmTCTGTC 

151 AGTCCACTTC ACCAAGCCTG CCCTTGGACA AGGACCCGAT GCCCAACCCC 

201 AGGCCTGGCA AGCCCTCGGC CCCTTCCTTG GCCCTTGGCC CATCCCCAGG 

251 AGCCTCGCCC AGCTGGAGGG CTGCACCCAA AGCCTCAGAC CTGCTGGGGG 

301 CCCGGGGCCC AGGGGGAACC TTCCAGGGCC GAGATCTTCG AGGCGGGGCC 

351 CATGCCTCCT CTTCTTCCTT GAACCCCATG CCACCATCGC AGCTGCAGCT 

401 GCCCACACTG CCCCTAGTCA TGGTGGCACC CTCCGGGGCA CGGCTGGGCC 

451 CCTTGCCCCA CTTACAGGCA CTCCTCCAGG ACAGGCCACA TTTCATGCAC 

501 CAGCTCTCAA CGGTGGATGC CCACGCCCGG ACCCCTGTGC TGCAGGTGCV 

551 CCCCCTGGAG AGCCCAGCCA TGATCAGCCT CACACCACCC ACCACCGCCA 

601 CTGGGGTCTT CTCCCTCAAG GCCCGGCCTG GCCTCCCACC TGGGATCAAC 

651 GTGGCCAGCC TGGM.TGGGT GTCCAGGGAG CCGGCACTGC TCTCaACCTT 

701 CCCAAATCCC AGTGCACCCA -SGAAGGACAG CACCCTTTCG GCTGTGCCCC 

751 AGAGCTCCTA CCCACTGCTG GCAAATGGTG TCTGCAAGTG GCCCGGATGT 

801 GAGAAGGTCT TCGAAGAGCC AGAGGACTTC CTCAAGCACT GCCAGGCGa^ 

851 CCATCTTCTG GATGAGAAGG 'SCAGGGCACA ATGTCTCCTC CAGAGAGAGA 

901 TGGTACAGTC TCTGGAGCAG C\GCTGGTGC TGGAGAAGGA GAAGCTGAGT 

951 GCCATGCAGG CCCACCTGGC 7GGGAAAATG GCACTGACCA AGGCTTCATC 

1001 TGTGGCATCA TCCGACAAGG 3CTCCTGCTG CATCGTAGCT GCTGGCAGCC 

1051 AAGGCCCTGT CGTCCCAGCC XGGTCTGGCC CCCGGGAGGC CCCTGACAGC 

1101 CTGTTTGCTG TCCGGAGGCA CCTGTGGGGT AGCCATGGAA ACAGCACATT 

1151 CCCAGAGTTC CTCCACAACA TGGACTACTT CAAGTTCCAC AACATGCGAC 

1201 CCCCTTTCAC CTACGCCACG CTCATCCGCT GGGCCATCCT GGAGGCTCCA 

1251 GAGAAGCAGC GGACACTCAA TGAGATCTAC CACTGGTTCA CACGCATGTT 

13 01 TGCCTTCTTC AGAAACCATC -CTGCCACCTG GAAGAACGCC ATCCGCCACA 

1351 ACCTGAGTCT GCACAAGTGC TTTGTGCGGG TGGAGAGCGA GAAGGGGGCT 

1401 GTGTGGACCG TGGATGAGCT GGAGTTCCGC AAGAAACGGA GCCAGAGGCC 

1451 OVGCAGGTGT TCCAACCCTA CACCTGGCCC CTGACCTCAA GATCAAGGAA 

15 01 AGGAGGATGG ACGAACAGGG 3CCAAACTGG TGGGAGGCAG AGGTGGTGGG 

1551 GGCAGGGATG ATAGGCCCTG GATGTGCCCA CAGGGACCAA GAAGTGAGGT 

1601 TTCCACTGTC TTGCCTGCCA GGGCCCCTGT TCCCCCGCTG GCAGCCACCC 

1651 CCTCCCCCAT CATATCCTTT GCCCCAAGGC TGCTCAGAGG GGCCCCGGTC 

1701 CTGGCCCCAG CCCCCACCTC CGCCCCAGAC ACACCCCCCA GTCGAGCCCT 

1751 GCAGCCAAAC AGAGCCTTCA CAACCAGCCA CACAGAGCCT GCCTCAGCTG 

1801 CTCGCACAGA TTACTTCAGG GCTGGAAAAG TCACACAGAC ACACAAAATG 

1851 TCACAATCCT GTCCCTCAC 



Figure 3 



HUMAN FKH^ PROTEIN SEQUENCE 



1 MPNPRPGKPS APSIiAIiGPSP GASPSWRAAP KASDLLGARG PGGTFQGRDI, 

51 RGGAHASSSS LNPMPPSQLQ LPTLPLVMVA PS(3ARIiGPLP HLQALLQDRP 

XOl HFMHQLSTVD AHARTPVLQV HPLESPAMIS I.TPPTTATGV FSLKARPGLP 

151 PGINVASLEW VSREPALLCT FPNPSAPRKD STLSAVPQSS YPLLANGVCK 

201 WPGCEK7FEE PEDFIiKHCQA DHIiLDEKGRA QCLI^REMVQ SLEQQIiVLEK 

251 EKLSAMQAHL AGKMALTKAS SVASSDKGSC CIVAAGSQGP WPAWSGPRE 

301 APDSLFAYRR HLWGSHOTST FPEFLHNMDY FKFHNMRPPF TVATLIRWAI 

351 LEAPEKQRTL NEI'XHWFTHM FAFFRHHPAT WKNAIRHNLS LHKCFVRVES 

401 SKGAWTVDE LEFRKSllSQR PSRCSNPTPG P* 



Figure 4 
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